Coarse-grained lattice model for investigating the role of cooperativity in molecular 

recognition 
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Equilibrium aspects of molecular recognition of rigid biomolecules are investigated using coarse- 
grained lattice models. The analysis is carried out in two stages. First an ensemble of probe 
molecules is designed with respect to the target biomolecule. The recognition ability of the probe 
ensemble is then investigated by calculating the free energy of association. The influence of cooper- 
ative and anti-cooperative effects accompanying the association of the target and probe molecules is 
studied. Numerical findings are presented and compared to analytical results which can be obtained 
in the limit of dominating cooperativity and in the mean-field formulation of the models. 

PACS numbers: 87.15.-v, 87.15.Aa, 89.20.-a 



I. INTRODUCTION 

Molecular recognition is the ability of a biomolecule to 
interact preferentially with a particular target molecule 
among a vast variety of different but almost identically 
looking rival molecules. Examples of specific recognition 
processes comprise enzyme-substrate binding, antibody- 
antigen binding, protein-receptor interactions or cell- 
mediated recognition [H, Q- Molecular recognition is es- 
sential for biological systems such as the immune system 
to work efficiently. Whereas macromolecules are held to- 
gether by covalent bonds the recognition process is gov- 
erned by specific non-covalent interactions such as ionic 
binding, the van der Waals interaction the formation 
of hydrogen bonds and hydrophobicity In an aque- 
ous environment those non-covalent bonds contribute an 
energy of the order of 1-2 kcal/mole with the relatively 
strong hydrogen bonds sometimes contributing up to 8- 
10 kcal/mole The non-covalent bonds are thus only 
slightly stronger than the thermal energy k^T^^^^ ~ 0.62 
kcal/mole at physiological conditions and therefore the 
specificity of biomolecule recognition is only achieved if 
a large number of functional groups of the two molecules 
to recognise each other precisely match and thus a large 
number of non-covalent bonds can be formed The 
binding sites of the two molecules are said to be com- 
plementary to each other. This view of molecular recog- 
nition for infiexible macromolecules is sometimes called 
"lock-and-key" mechanism However there are notable 
recognition processes that involve fiexible biomolecules 
Q • The matching of a large number of functional groups 
is then achieved by a conformational change giving rise 
to large entropic contributions (so-called "induced fit" 
scheme) [H]. In addition to short-range interactions en- 
suring the stability of the complex for a sufficiently large 
time long-range electrostatic interactions are believed to 
pre-orient the biomolecules so that the probability of 
the contact of the complementary patches on the two 
molecules upon collision is increased [l3| • 

An understanding of the principles of recognition pro- 
cesses between biomolecules is not only important from 
a scientific point of view but also for biotechnological 



and biomedical applications. The knowledge of these 
principles is a necessary input for the design of syn- 
thetic heteropolymers with molecular recognition ability 
so that they can interact with a biological environment, 
i.e. biomolecules, cells and tissues, in a programmable 
way (see e.g. the review by 0)- 

In recent years much effort has been spent to investi- 
gate the structural basis for the recognition of two rigid 
proteins 0, S, [HI, E [H, [H- In particular the recogni- 
tion sites of the two proteins in contact have been anal- 
ysed. The recognition site on a protein basically consists 
of the residues, i.e. amino acids, which interact with 
residues of the other proteins. It is found to be made up 
of largely hydrophobic residues so that its hydrophobic- 
ity is comparable with that of the interior of the protein. 
For the development of idealised coarse-grained models it 
is therefore assumed that hydrophobicity plays a major 
role in recognition processes. Consequently the residue 
interactions in the ideahsed models investigated in this 
article are assumed to be purely of hydrophobic nature. 

The investigation of the underlying mechanisms of 
molecular recognition processes from a physical point of 
view has recently gained growing interest. In particu- 
lar the question of the specificity of recognition processes 
has been addressed by methods of statistical mechanics 
[il, M, M, H M, M, M, H HI, [iil . Nevertheless, 
the view of specificity, which is basically the occurrence 
of a preferential binding of the recognition agents in the 
presence of a diversity of rival molecules, remains yet in- 
complete (the introductory remarks in [27| about the di- 
versity of definitions of specificity found in the literature 
still apply). 

In this article we develop coarse-grained lattice models 
for the investigation of the principles of molecular recog- 
nition processes. Our approach, which is described in 
section [ni consists of two stages: In a first step a design 
of probe molecules is carried out. This step mimics the 
design in biotechnological applications or the evolution in 
nature. In a second step the recognition ability is calcu- 
lated by considering the free energy of association of the 
probe molecules with the target and a structurally differ- 
ent rival molecule. This general approach is illustrated 
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for a modified hydrophobic-polar (HP) model in section 
mil In section |IV] the modified HP model is extended 
to take cooperative effects on a residue-specific level into 
account. The resulting model is investigated in its mean- 
field formulation and in the limiting case of dominant 
cooperativity which can be tackled analytically. In ad- 
dition the model is investigated numerically for the case 
where the contributions of the direct residue-residue in- 
teractions and the indirect cooperative interactions are 
of same order. The findings are compared to the Hmiting 
case of dominating cooperativity and to the mean field 
findings. In section [Vl another possible way to incorpo- 
rate cooperative effects is analysed. The article closes 
with a conclusion and an outlook (section lyi]) . Among 
other findings some of the numerical results have been 
published recently in a separate Letter [2^ . 



II. GENERAL APPROACH 

In this article we study coarse-grained models for the 
recognition of two rigid proteins. Under physiological 
conditions the complex of the proteins is stabiHsed by 
non-covalent interactions across its interface. The bind- 
ing of the proteins is accompanied by a decrease of en- 
tropy due to immobiHsing translational, rotational and 
conformational degrees of freedom. The gain in energy 
on forming the complex has thus to be strong enough to 
overcome these entropic costs. In our model the proteins 
are considered to be rigid so that conformational changes 
of the backbone of the proteins need not be taken into 
account. This assumption is fulfilled for a large variety 
of real protein-protein associations, however, even for the 
association of two rigid proteins minor rearrangements of 
the side chains of the amino acids will occur (e.g. [lH). 

The energetics at the contact interface of the com- 
plex can be formulated in a coarse-grained way where 
coarse-graining is adopted for both the structural prop- 
erties of the recognition sites of the involved biomolecules 
and for the interaction between two residues jH]. Con- 
sider a recognition site of N residues on both proteins. 
For simplicity it is assumed that the two recognition sites 
contain the same number of residues and that precisely 
two residues match respectively in the interface. Notice 
that a recognition site found in natural protein-protein 
cornplexes contains typically of the order of 30 residues 
0, The chemical structure of the recognition site 

of the protein to be recognised, which is called target 
molecule in the following, is characterised in a coarse- 
grained approach by a discrete variable a = (cti, . . . , ctat) 
where the value of <7i specifies the type of the residue at 
positions i, i = 1, . . . , N , on the recognition site. Simi- 
larly, the types of the residues of the recognition site of 
the other protein which recognises the target are speci- 
fied by a second variable 6* = (6*^, . . . , 6'jv). In the follow- 
ing this second biomolecule is called probe molecule. On 
a coarse-grained level, the interaction of the functional 
groups across the interface is described by a Hamiltonian 



7Y(cr, 0; S) where we incorporate an additional interaction 
variable S = {Si, . . . , Sn). The variable Si takes the 
quality of the contact of the residues of the two proteins 
at position i into account, where a good contact leads to 
a favourable contribution to binding and a bad one only 
to a small contribution. A good contact may imply, for 
example, that the distance between the two residues is 
small or the polar moments of residues are appropriately 
aligned to each other. A steric hindrance on the other 
hand may result in a large distance between the residues 
and consequently one has a bad contact. The variable 
S therefore models effects that are related to minor re- 
arrangements of the side-chains of the amino acids when 
the two proteins form a complex. 

Along these general lines a first model, namely a mod- 
ification of the hydrophobic-polar (HP) model, can be 
formulated. In the HP-model only two different types 
of residues are distinguished, namely hydrophobic (H) 
and polar (P), i.e. hydrophilic, ones, so that the vari- 
ables a and 9 specify the degree of hydrophobicity of the 
residues. This restriction to the hydrophobic interaction 
is motivated by the observation that the hydrophobicity 
is a major property that discriminates the recognition 
site from other patches on the surface of a protein. Hy- 
drophobic residues are described by the variable cr,; = +1 
and polar residues by = — 1. The Hamiltonian is then 
given by 

7^Hp(a,^;5) = -e^^-^M. (1) 

i=l 

where the sum extents over the N positions of the 
residues of the recognition site and the interaction con- 
stant e is positive. It is typically of the order of e ~ 1 
kcal/mole for hydrophobic interactions The product 
—eaiOi describes the mutual interaction of the residues 
in contact across the interface. The additional variable 
Si can take on the values ±1. Thus for Si = +1 one 
has a good contact leading to a non-zero contribution to 
the total interaction energy, for Si = —1, on the other 
hand, one has a bad contact and no energy contribu- 
tion. Notice that a good contact not necessarily leads 
to a favourable energy contribution. Note also that the 
original HP-model, which has been introduced to study 
the protein folding problem [2^, does not contain an ad- 
ditional variable S to model the quality of the contact. 

The grouping of the 20 natural amino acids into classes 
of characteristic types is very important for the develop- 
ment of minimal models for the study of protein inter- 
actions. The reduction to a hydrophobic and a polar 
type and thus the use of an Ising-like model Hamiltonian 
such as (HJ) on a coarse-grained level is also justified by 
the findings in [sOl- In this work Li et al. appHed an 
eigenvalue decomposition to the Miyazawa-Jernigan ma- 
trix of inter-residue contact energies of amino acids. They 
found that the interaction matrix can be parameterised 
by an Ising-like model where the "spin variable" can take 
on different discrete values. As these values show a bi- 
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modal distribution the reparameterisation basically re- 
duces to the Ising model where the two possible values 
of the "spins" describe hydrophobic and polar residues. 
Introducing the additional variable S for the rearrange- 
ment of the amino acid side chains we end up with Hamil- 
tonian ([l]). Suggested by experimental observations the 
grouping of the amino acids into five characteristic groups 
is also widely discussed (sil . [s^l- The reduction in [32 |. 
for example, uses a distance-based clustering applied to 
the Miyazawa-Jernigan matrix. The resulting grouping 
reproduces the statistical and kinetic features of well- 
designed sequences in the protein-folding problem. The 
grouping into five different characteristic types in these 
approaches points at possible extensions of our model for 
the contact interaction. In this work, however, we restrict 
ourselves only to hydrophic and polar residue types. 

To study the recognition process between the two 
biomolecules we adopt a two-stage approach. First the 
structure of the recognition site of the target molecule 
is fixed to a certain sequence cr^^^ — {a^\ . . . , a^^^ ) of 
residues. Then this structure is learned by the probe with 
respect to some learning rules under conditions that are 
specified by a parameter (3^ ■ This leads to an ensemble of 
probe molecules of sequences 9 at their recognition sites 
with a probability P(0|cr'"') depending on the initially 
fixed target structure. To illustrate this a bit further con- 
sider a design step where learning is done just by thermal 
equilibration. The probability distribution is then tech- 
nically given by the canonical Boltzmann distribution 

P{0\<J^"^) = ^^exp(-/?„H(a(°),^;5)) (2) 

° s 

where the normalisation is the usual canonical parti- 
tion function. The design temperature acts as a La- 
grange multiplier that fixes the average energy, however, 
the parameter (Hr, may also be interpreted to describe 
more generally the conditions under which the design has 
been carried out. This first design step is introduced to 
mimic the design in biotechnological applications or the 
process of evolution in nature [33|. Note that in some 
studies of the protein folding problem [33 . [35| and the 
adsorption of polymers on structured surfaces [3a| a sim- 
ilar design step has been incorporated. 

In the second step the recognition ability of the de- 
signed probe ensemble of structures 6 is tested. To this 
end the ensemble is brought into interaction with both 
the picked target structure cr^"-' and a competing (differ- 
ent) structure a^^^ at some inverse temperature (3 which 
in general is different from the design temperature /3d- 
The free energy of the probe system interacting with the 
structure cr^"-', a = 0, 1, is then 

pio^) = J2 F{e\a^"'^)P{0\a'-"^) (3) 
e 

where F{9\<7'^"^) is the thermal free energy for the inter- 
action between a^"^ and a fixed probe sequence 9 and 



an average over the structures in the probe ensemble is 
carried out. The free energy F{9\a^'^^) is given by 

= -iln^exp(-/3H(a("\0;5)) . (4) 

^ s 

The target with the structure cr*^"-' at its recognition 
site is recognised by the probe if the associated free en- 
ergy i^^"^ is lower than the free energy for the m- 
teraction with the competing structure a^^\ i.e. in a 
mixture of equally many cr'"' and cr^^^ molecules the 
probe molecules preferentially bind to the original tar- 
get. This is signalled by a negative free energy difference 
AF(cr(o),cr(i)) = - Thus the specificity of 

the recognition process is related to the difference be- 
tween the free energy of association for the competing 
molecules. For given structures cr^^^ and tr*^^^ one can 
introduce a suitable measure Q for the structural simi- 
larity of the target and the rival biomolecule. Carrying 
out an average over all target and rival structures that are 
compatible with the specified similarity Q one can com- 
pute the averaged free energy difference of association 
AF{Q) as a function of the similarity between the target 
and the rival and therefore investigate the overall recog- 
nition ability of the model (see section [Ull below for the 
HP-model). Note that in our approach the mechanism 
which brings the two interacting molecules, in particular 
the two recognition sites, into contact is not taken into 
account, i.e., only equilibrium aspects are considered. 

In principle interactions of the residues which do not 
belong to the recognition sites with solvent molecules 
have to be considered as well. Solvation effects at the 
recognition sites and the associated entropy changes are 
also important for the association process of biomolecules 
[stI. [38|. In the coarse-grained model, however, it is as- 
sumed that all these contributions are of the same size for 
all proteins under consideration. Note also that solvation 
effects are already partially contained in HP-models. In 
addition the entropic contributions due to a reduction of 
the translational and rotational degrees of freedom upon 
forming a complex can be assumed to cancel out in the 
free energy difference AF in a first approximation. This 
requires at least that the two competing proteins are of 
comparable shape and size. 

In this work we assume that the proteins have the 
same number of residues at the interface. However, many 
protein-protein interfaces are curved with different num- 
bers of residues on the two proteins Nevertheless, we 
expect our assumption not to be crucial within the above 
simplified coarse-grained view, at least in a first approx- 
imation. As our model characterises the residues only 
with respect to their hydrophobicity one can partition 
the interface into N contacts and attribute hydrophobic- 
ities to the patches on the two proteins that contribute 
to a particular contact. Then one ends up again with 
our Hamiltonian H]). For approaches where the residue 
type is determined by additional features apart form hy- 
drophobicity correlations between neighbouring patches 
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might occur so that our assumption may become ques- 
tionable. 



III. APPLICATION TO A MODIFIED 
HYDROPHOBIC-POLAR MODEL 

The modified HP-model of the previous section, can 
again serve as an illustration of the two-state approach 
for investigating molecular recognition processes. As ^ 
does not involve any interaction between neighbouring 
residues of the recognition site of a protein, the two-stage 
approach can be worked out exactly. 

a. Design by equilibration For the HP-model the de- 
sign governed by thermal equilibration leads to the con- 
ditional probability 

P(%("))^-^-i— -/ (5) 
(2 cosh (^)) 

of the structure 6 at the recognition site of the probe 
molecule. As mentioned in the previous section the de- 
sign temperature /?d may be interpreted to characterise 
the conditions under which the design has been carried 
out. This can be illustrated using the present exam- 
ple of the HP-model. In the HP-model the value of ai 
or 9i, respectively, basically specifies the hydrophobic- 
ity of the residue at position i. The total hydrophobic- 
ity of the recognition site of the target molecule is then 
= J2,(7f\ From relation © one can calculate the 
average hydrophobicity (iJo) of probe structures: 

(H.) = EE^^^C^I-*"') = tanh f'-f) . (6) 

k ^ ^ 

Thus the Lagrange parameter can be used to fix the 
average hydrophobicity of the designed probe ensemble 
which is achieved by controUing the supply of hydropho- 
bic residues during the design procedure. 

The probability distribution ^ for the designed struc- 
tures 9 expHcitly depends on the structure cr^"^ of the 
recognition site of the fixed target molecule. For the HP- 
model a design under ideal conditions, i. e. 1//3d = 0, the 
structure 9 would simply be a copy of cr'-^-' . However, for 
non-ideal conditions with /?d < 00 "defects" appear in the 
design procedure and the obtained structure 9 deviates 
on average from cr^^). This deviation can be quantified 
by the complementarity parameter 

/C(^,a("))=^e,af. (7) 

i 

The possible values of /C range from —A^ to A'' in even 
steps. A value /C(0,ct^°^) close to N means a high com- 
plementarity and the interaction of the probe structure 
9 with (t(°^ can lead to a large enough energy decrease 
so that a complex can be formed. On the other hand 



a value of ]C{9,(7^^'>) much less than N signals a poor 
match between the two recognition sites and therefore it 
is unHkely that a complex is stabilised. 

The probability distribution P(6'|(T^°^) can be con- 
verted to a distribution function for the complementarity 
leading to the probability 

P{K) = 5]P(0|aW)<5^(,,,(o,),^ (8) 
e 

to have a complementarity parameter K in the designed 
ensemble. The quality of the design can now be mea- 
sured by the average complementarity of the designed 
structures 9 which is given by 

(K) = J2 KP^^^) = ^ta^^^ (^) (10) 

for the modified HP-model. For large /3d one gets a probe 
ensemble which is fairly complementary to the fixed tar- 
get structure. Thus large values of /S^ correspond to good 
design conditions, an observation which can already be 
deduced from the interpretation of (3-d as an inverse tem- 
perature. In the hydrophobicity interpretation discussed 
above large values of /3d signify comparable hydrophobic- 
ities of the target and the probe molecule. 

6. Recognition ability The recognition ability of the 
probe molecules is tested by comparing the free energy 
of association with the target structure ct'"' and a com- 
peting molecule with the structure a^^^ at its recognition 
site. For the HP-model ([1]) with its two different types 
of residues, one can introduce the similarity parameter 

Q=E-r^-r^- (11) 

i 

For Q close to its maximum value N the competing 
molecule has a recognition site that is almost identical 
to the one of the target molecule. In terms of the simi- 
larity parameter Q the free energy difference is given by 

AP(Q) = -i£A^tanh(^^^ (A-g). (12) 

The free energy difference is always negative as soon as 
the recognition site of the competing molecule is not iden- 
tical to the one on the target molecule. In equilibrium 
the probe molecule therefore binds preferentially to the 
target molecule and thus the target molecule is specifi- 
cally recognised. The difference in free energy increases 
for an decreasing similarity parameter Q. Note also that 
the slope of the free energy difference depends only on the 
conditions under which the design of the probe molecules 
has been carried out. 
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IV. ROLE OF COOPERATIVITY IN 
MOLECULAR RECOGNITION 

Cooperative effects play an essential role in many bio- 
logical processes such as the catalysis of biochemical re- 
actions by enzymes. Cooperativity is presumably also 
very important for molecular recognition processes [39| . 
In general cooperativity means that the binding strength 
of two residues depends on the binding interactions in 
the neighbourhood of the two residues in contact. Thus 
the energetic properties of residues when interacting with 
other residues cannot be inferred by considering them 
isolated from the local environment. This has implic- 
itly been done, however, in the modified HP model ^ 
where the interaction constant e has been attributed to 
the residue-residue interaction independently of the cor- 
responding local environment. 

In this section the modified HP-model of the preceed- 
ing section is extended to incorporate the effect of coop- 
erative interactions on molecular recognition. Note that 
in reference [s^l it has been argued that cooperativity 
should be incorporated on a residue-specific level. 

During the association process rearrangements of the 
amino acid side chains are observed. Thus in the idealised 
model applied in this work cooperative effects stem from 
the behaviour of the variables Si. A possible extension 
of the modified HP-model which takes cooperative inter- 
actions into account is given by 



n{(T,0;S) 



^ 1 

E- 



1=1 



(13) 



The first sum describes again the hydrophobic interac- 
tion whereas the second sum represents the additional 
cooperative interaction. It extends over the neighbour 
positions of the residue at position i. For a fixed « on a 
square-lattice the sum includes therefore four terms. The 
interaction coefficient J is positive for cooperative inter- 
actions and negative for anti-cooperative interactions. To 
get an impression of its effect consider the design step. 
Suppose that at position i one has a hydrophobic residue 
on the target molecule. Then the first term in the HP- 
Hamiltonian lfT3|) favours that a hydrophobic residue ad- 
sorbs there with a good contact Si = +1 on average. 
Suppose now that on one of the neighbouring positions j 
of i on the target one has again a hydrophobic residue. If 
a hydrophobic residue gets adsorbed at the correspond- 
ing position of the probe structure a good contact with 
Sj = +1 is favoured by the hydrophobic interaction term 



in IjlSp . But then the second cooperative term leads to 
an additional energy decrease for J > 0. If on the other 
hand a polar residue shows up at the position j on the 
probe molecule the hydrophobic contribution in ifl3|) tries 
to avoid a contact, i. e. 5"^ = —1, on average, which then 
leads to an unfavourable energy increase due to the coop- 
erative term. The quality of a contact thus couples to the 
quality of the contacts in the neighbourhood of a residue. 
For a positive constant J the cooperativity is therefore 



expected to enhance the fit of the molecules at the inter- 
face resulting in an increased average complementarity 
compared to an interaction without cooperativity. Sim- 
ilarly, one expects an increase in the recognition speci- 
ficity. In the subsequent paragraphs these suggestions 
are investigated for cooperative interactions. 

Note that Oi (and thus the product (JiOi) in Hamil- 
tonian lfT3|) is a random variable whose distribution is 
obtained by the design step. The energy function (fT3|) 
describes therefore a random field Ising model. Contrary 
to the models mostly investigated in the literature (e. g. 
[iol. l4l|) the distribution function of the random variable 
aiOi is not symmetric with respect to a sign-refiection. 



A. Limiting case of dominant cooperativity 

The case where the cooperative contribution to the 
total energy dominates can be investigated analytically. 
Consider the situation where J ^ Ne. The cooperative 



term ~JJ2{ij) ^i^j the Hamiltonian lfT3|) has discrete 
energy levels —ANJ, —i{N—l)J, . . . , +4:NJ for a recogni- 
tion site with a rectangular geometry where each residue 
has four neighbours. The hydrophobic interaction term 
12i ^i^i has also discrete levels ranging from —Ne 
to -^Ne. For the above assumption J ;» Ne the global 



rough structure of the spectrum of the Hamiltonian l|T3|) 
is basically determined by the cooperative contributions. 
The hydrophobic interaction of the residues in contact in- 
troduces only small variations about the main energy lev- 
els with two adjacent ones being separated by an amount 
of 4 J. For a small temperature, i. e. a large /?, the statis- 
tical behaviour is dominated by the twofold degenerate 
lowest energy state of the cooperative interaction term 
with all Si being either in the state -1-1 or in the state 
— 1. Due to this reduction of the phase space of possible 
S configurations the two-stage approach can be worked 
out analytically. The dominance of the cooperative term 
leads to the new effective Hamiltonian 

UV -^-±leY^o,6,-mj (14) 

i=i 

where the scalar variable s can have the values ±1. The 
design step now yields the probability distribution 



P(^^|a^")) 



1 + cxp ePoJ^^^i^ 



(0) 



2^ + (2 cosh (£/?„)) 



N 



(15) 



for the structure of the recognition site of the probe 
molecules. The corresponding distribution of the com- 
plementarity between the structures cr^") and e is 



m) - 1 



N 



1 + exp ie/3^K) 



(N + K)J 2N + (2 cosh (eP^)) 



(16) 



The distribution function for the complementarity pa- 
rameter K can again be used to calculate the average 
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complementarity of the designed molecules. For large N 
(for which the term 2^ in the denominator of l|16p can 
be neglected as long as /?d 0) one obtains 



{K) 



iVtanh (e/Jn) ■ 



(17) 



In the situation of a dominating cooperative interaction 
the average complementarity is increased compared to 
the case where cooperativity is absent. This suggests 
that values of the cooperativity interaction constant J 
comparable to the size of the hydrophobic interaction 
constant e might also enhance the quality of the design 
step. This question is investigated numerically in the 
subsequent paragraph II V Bl 

In the second step the designed probe ensemble inter- 
acts with the chosen target structure cr^^) and a compet- 
itive one cr^^^ . The associated free energy averaged with 
respect to the distribution of the structures 9 of the probe 
molecules is in general given by 



-Ij^ln (^1 + cxp (^£/3^0,a(")^^(18) 



1 + exp e/3D I]6'iCT; 



(0) 



2^ + (2 cosh (e/?!,)) 



N 



(19) 



In case of a large number of residues ^ 1 again further 
progress can be made analytically. Consider first the free 
energy of association of the system with the fixed target 
structure. In this case the sum over the possible struc- 
tures of the designed probe molecules can be converted 
into a sum over the complementarity parameter K: 



^i^ln(l + exp(£/3i^))P(X). 



(20) 



K 



The dominant contributions to this sum arise from the 
values of K close to the maximum of the distribution 
P{K). For suitably large /?d this maximum, however, 
occurs for K ~ 0{N) and thus it is large as well (compare 
relation lfT7|) ). Therefore, in the limit N ^ 1 one can 
use the replacements 1 -I- exp{Pj:,eK) « e:icp{l3oeK) and 
ln(l -I- exp{f3eK)) w PeK. Using these approximations 
the summation in ll20l) leads to 



F 



(0) 



03 



K 



-eNt(iiih{eP^). 

(21) 

A similar conversion cannot be applied to the summa- 
tion over the designed molecules in the calculation of F'^' 
as both ^io-.l"^ and OiO-^^^ terms appear. Defining the aux- 



iliary variables ki := OiCjf'' and q., 
that (af ^)2 = 1 the free energy F*^^) is explicitly given 

by 



(0) (1) J 

a\ (Ti and notmg 



i^(i) 



1 + cxp e/3^/c,g, (22) 



1 exp ef3^ J2 ^1 



■2^ + (2cosh(£/3^))~' 



(23) 



The variable ki specifies the local complementarity be- 
tween the target cr(°) and a particular probe structure 9. 
Using again the observation that the dominant contribu- 
tions originate from values of large K = J2i h one can 
use again the replacement 1 -I- exp{f3osK) « exp{(3j)eK). 
The logarithmic factor in (|22| gives large contributions 
if the majority of the qi variables is in state -1-1. Thus, 
in the limit of Q = J2i 1i ^ 1 the sum in ((22l) can be 
worked out and one obtains 



F 



(1) JV,Q»1 



-eQ ta,nh{ef3o). 



(24) 



The free energy difference in terms of the similarity Q of 
the competing molecules cr^°^ and cr'-^-' is now given by 



AF 



JV,Q»1 



-staiih{el3^){N - Q) 



(25) 



for positive and large Q. Again one has a linear depen- 
dence in the vicinity oi Q = N . This can be compared 
to the corresponding result lfT2|) for the situation with 
J = 0. The cooperativity increases the slope of the free 
energy difference and thus the recognition ability of the 
designed probe ensemble is increased by cooperativity. 

In the Hmit Q = J^i^i ^ ~1j on the other hand, 
almost all qi take on the value —1 and thus hqi is close 
to —N for those ki leading to the dominant contributions 
in 1I22I). One therefore has 



In I 1 + cxp [ £/3^^ kiC 



A'»l,Q-C-l 



cxp i~ePN) 



(26) 



for the logarithmic factor of the dominant terms in (|22)) . 
The free energy of association of the probe molecules with 
the rival molecule is thus F^^^ ^ 0{e~^) so that 



AF 



1 i7(0) ^ ^sNtSilM^Pn)- 



(27) 



In the Hmit Q <^ ~1 the free energy difference is thus 
independent of the similarity parameter Q between the 
target structure and the rival structure. 

For a similarity parameter |Q| ^ C(l) one expects de- 
viations form the behaviour for large \Q\. For the free 
energy difference per residue AF/N as a function of the 
similarity per residue Q/N, however, the deviations show 
up for similarities Q/N of the order of 1/iV. The free en- 
ergy difference per residue will thus develop a kink at 
Q/N = in the asymptotic limit of ^ 00 so that 
it is given by expression (|25)) for positive Q/N and by 
relation (|27| for negative Q/N. The range of values of 
the similarity per residue where deviations between the 
free energy for a system with finite and the asymptotic 
result show up is shrinking for increasing A^. 
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B. Numerical results for arbitrary cooperativity 

The above analysis of the limiting case J ;2> Ne with a 
dominant cooperative interaction suggests that coopera- 
tivity enhances the quality of the design step and eventu- 
ally increases the recognition ability. In this section this 
suggestion is investigated more closely for cooperativity 
constants J which are of the order of the interaction con- 
stant e of the hydrophobic interaction term in (fT3| . 

c. Design. For non-zero, but finite values of J it is 
not possible any more to solve the model analytically. 
Therefore, the two-stage approach has to be carried out 
numerically. To this end the density of states for the 
design step is calculated as a function of the energy and 
the complementarity parameter for a fixed cooperativity 
J. The density of states is generally given by 



(28) 



for a fixed target structure a^^^ . The density of states 
flj{K; E) is thus the number of (6*, S) configurations that 
have energy E when interacting with the target and a 
complementarity K of the probe molecule 9 to the target 
recognition site. In general the density of states depends 
additionally on the configuration (t^°^ of the recognition 
site of the target molecule. However, for the HP-model 
(fT3|) the density of states has no explicit dependence on 
cr*^"^ as the variables 9i can be transformed to the aux- 
ihary variables ki := cr,|°^0i, which have the same phase 
space as 6*;, so that cr'"' does not appear any more. 

The density of states can be calculated directly using 
efficient Monte Carlo algorithms 43, j3|- In this work 
the Wang-Landau algorithm has been applied. Once the 
density of states is known the probability distribution of 
the complementarity is basically obtained by calculating 
the Laplace transform of Hj giving up to a normalisation 

PjiK; 13^) ^ "■^(^'' ^) cxp(-/3oS)- (29) 

E 

From this distribution function one can calculate the 
average complementarity {K) (J) = J2k ^ji^'i l^o)K 
which is shown in figure [TJ The calculations have been 
carried out for a square-lattice geometry with N = 256 
residues. We have checked that the curves show only mi- 
nor finite-size effects for recognition sites of realistic sizes 
with N ~ 0(30) (see [Hi). The qualitative findings dis- 
cussed in the following are independent of the number 
N of residues involved in the interface. It is visible that 
cooperativity increases the average complementarity of 
the probe molecules for large enough values of /?d • For a 
parameter value of the order of £/3d — 1 a small change 
in the cooperativity J leads to a large difference in the 
average complementarity. Therefore, small changes in J 
can have a large impact on the quality of the design step. 
As the typical energy e of a non-covalent bond is of the 
order of 1 kcal/mole this regime corresponds indeed to 
physiological conditions. 




FIG. 1: Average complementarity per site of the designed en- 
semble for the HP-model iflSll for different values of J. The 
lower dashed curve corresponds to J = 0, the upper dashed 
line represents the limiting case J — » oo (for large A''). The 
solid curves in between from the bottom up correspond to the 
values 0.1, 0.25, 0.5, 0.75 and 1.0 of J in units of e. The dot- 
ted curve shows the result for a system with additional next 
nearest neighbour cooperativity with J„„„/e = J„„/e = 0.1. 
The inset compares the numerical results (solid lines) with the 
mean field findings of section llV Cl fcirclesl for cooperativities 
0.25 and 0.5. The dashed curve corresponds again to J = 0. 



The Hamiltonian ifTSl) contains a cooperative term 
where the quality of the contact couples to the contact 
variable at the neighbouring sites. This limitation to 
nearest neighbour interactions can be relaxed by allow- 
ing additional couplings to sites that are further away. 
As long as the range of the cooperative coupling is finite, 
however, we expect, that the average complementarity 
(K) is qualitatively similar as for the model (fTSl . For the 
system with nearest and next nearest neighbour interac- 
tions (with the same constant J) the case of dominant co- 
operativity can be treated as above (section FlV Ap leading 
to the same effective Hamiltonian (fT4|) with the irrelevant 
constant replaced by ~8NJ. So the same Hmiting curves 
for (K) as well as AF result. However, the additional in- 
teractions have the consequence that the maximum effect 
of cooperativity will already show up for smaller values of 
J. This is shown in figure[l]for the model with additional 
next nearest neighbour cooperativity. 

Before analysing the recognition ability for J 7^ con- 
sider briefly the influence of an anti-cooperative inter- 
action with J < in Hamiltonian (fT3)) on the average 
complementarity (K). From the discussion of the effect 
of the cooperative term within the design step one may 
expect that anti-cooperative interactions should decrease 
(K). For a probe molecule with a high complementarity 
to the target molecule all Si tend to be in state -1-1 to 
ensure good contacts and thus a large energy decrease 
due to the hydrophobic interaction. However, the anti- 
cooperative term then leads to an energy increase so that 
the two contributions to the Hamiltonian lfT3|) compete 
with each other. Two different regimes can now be ex- 
pected. Large values of the parameter favour struc- 
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tures 9 that are highly complementary to the target a^^\ 
For > J > — e/8 the hydrophobic interaction term is 
dominant leading to a majority of good contacts Si = +1 
and thus {K) is expected to become N for increasing /3d . 
However, if J < —e/8 the second anti-cooperative term 
dominates leading to an alternating structure of good and 
bad contacts where the Si of two neighbouring positions 
have different signs. Note that in such a situation the 
direct hydrophobic-polar interaction contributes a maxi- 
mum favourable energy —e/2 per site whereas the coop- 
erative term gives the maximum contribution 4 J per site 
giving the cross-over value J = —e/8 for the considered 
square geometry . For one half of the residues one there- 
fore has preferably good contacts so that the residue on 
the probe molecule is of the same type as the one on the 
target molecule on average. For the other half of posi- 
tions, however, one has bad contacts. For those positions 
the hydrophobic interaction term in ([13)) does not con- 
tribute and the probabilities of the residue on to be 
hydrophobic or polar at such positions are equal. For 
J < —e/8 one thus expects that (K) tends to N/2 for 
increasing /3d- These expectations are indeed confirmed 
by numerical investigations as shown in figure [2l 




FIG. 2: Average complementarity of the probe ensemble for 
the anti-cooperative HP-model l|13p with J < 0. The dashed 
line represents J = 0, the solid curves from top to bottom 
correspond to the values -0.1, -0.2 and -0.5 of J in units of e. 



In the general discussion of the extended model ifTSj) 
it has been argued that the cooperative term will in- 
crease the effective contribution of a residue-residue con- 
tact at the interface between the two biomolecules. To 
get an impression of this increase one can define an ef- 
fective residue-residue interaction constant e^ff{f3o,J) := 
(H(J)) / {Hup) by considering the average interaction 
energy of the probe ensemble with the target molecule 
for different values of the cooperativity J. Figure [3] shows 
that this effective interaction constant is indeed increased 
by the cooperative term in the Hamiltonian ifTS]). 




FIG. 3: Effective interaction constant Eeff as defined in the 
main text as a function of the temperature for the model 
ifTSjl . The solid curves correspond to values 0.25, 0.5, 0.75 
and 1.0 of J (in units of e) from the bottom up. 



quantified by the free energy difference 



6^^ (0, (i)AF(fT(o),aW)) 



J„ (0) (1) / 

Q^^'^l /<t(0) 



(30) 



d. Recognition ability. The knowledge of the density 
of states allows the calculation of the recognition ability 



for the association of probe molecules with the two struc- 
tures CT^"' and cr(^^ The results are shown in figure[4]for 
different values of the J. For comparison the free en- 
ergy difference for the system with additional next near- 
est neighbour cooperativity is shown as well. An increase 
in J increases the free energy difference and therefore the 
recognition specificity of the probe molecules. For a value 
of J of the order e the maximum effect of cooperativity 
has already been reached for the considered temperature 
values /3d = /3 = 1-0. Thus, the expected increase of the 
recognition ability by cooperativity for constants J ~ e 
is indeed confirmed by the numerical results. 

To study the infiuence of different cooperativities on 
the recognition ability in a more direct way the follow- 
ing approach can be adopted. The cooperativity already 
infiuences the design step and optimises the probe ensem- 
ble with respect to the original target structure as can be 
seen by the dependence of {K) on the J. This better op- 
timisation infiuences the testing step as well. In order to 
investigate the pure infiuence of the cooperative interac- 
tion on the recognition ability more closely one can use 
probe ensembles where the average complementarity is 
fixed to some values Kq for different J. This can be done 
by carrying out the design of the probe molecules at dif- 
ferent design temperatures such that {K) (/3d, J) ~ Kq. 
The probability distributions obtained when this addi- 
tional constraint is appHed are then used to calculated the 
difference of the free energy of association of the probe 
molecules with both the target and the rival molecule. 
The results are shown in figure [5] for recognition sites 
with = 64 residues. Again it can be seen that an in- 
crease in the cooperativity J increases the free energy 
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FIG. 4: Free energy difference per site of the association of 
the probe ensemble with the two competing molecules as a 
function of their similarity for different cooperativities J in 
l|13p . The upper dashed line corresponds to J = 0. The 
lower dashed line describes the limiting case J — » oo in the 
limit of large A'^ f section [IV Ap . The solid curves from top 
to bottom correspond to the same values of J as in figure [T] 
The dotted curve shows the result for a system with additional 
next nearest neighbour cooperativity with J„„„/e = J„„/e — 
0.1. The parameters (3-d and (3 are both 1.0. 



difference for a fixed similarity Q/N between the target 
and the rival biomolecule. The dashed lines in figure [5] 
represent the free energy difference for J = and for the 
asymptotic regime J ^ oo with iV 1. For large Q/N 
and large J the free energy difference is already well rep- 
resented by the asymptotic result. For a cooperativity 
J ~ e the maximum effect is thus already achieved. 




-0.5 

Q/N 

FIG. 5: Free energy difference as a function of the similarity 
for different cooperativities J (with (3 = 1.0) where the probe 
ensemble has been designed to have a fixed {K) /N = 0.4. 
The upper dashed lines corresponds to J = 0, the lower one 
describes the limiting case J ^ oo (and large A*'). The values 
of J/e in lfT3|l are 0.25, 0.5, 0.75 and 1.0 for the solid curves 
from top to bottom. For the dotted line J/e = —1/2. 



fixed structure cr^"^ of the recognition site of the target. 
As the similarity parameter Q is minimum, the compet- 
ing molecule has the structure ct^^' = — cr^^) at its recogni- 
tion site. The free energy difference of association is then 
given by AF{-N) = -i Pj(0|a(o))(ln Z(0|a(")) - 

\nZ{e\ - cr("))). The partition function Z(6i|cr(i)) = 
Z{6\ — cr(°)) related to the rival structure expHcitly reads 



Z{e\-a^">)^e-^^'''^ (31) 



(33) 



where a transformation Si —Si has been used for the 
last equality. Note that the phase space for 5" is the same 
as for the variable S. Thus the free energy difference at 
Q = —N is generally given by 

AF(Q = -TV) = -eJ2PjiK;PD)K = -e (K) (J). 

K 

(34) 

As the average (K) is fixed to the value Kq for different 
J the free energy difference is the same for all J. 

For the HP-model with pure hydrophobic interactions 
the free energy difference is independent of the conditions 
under which the recognition ability is tested. It is only 
determined by the design conditions (compare relation 
(fT2l) ). For the extended HP-model l|13p with cooperative 
interactions this is no longer the case. Apart from the 
design conditions, encoded in the Lagrange parameter 
the free energy difference depends on the f3 which 
specifies the conditions for the testing step. In figure [6] 
the free energy difference is shown for different values 
of /3. The cooperativity constant is fixed to be J/e = 
1/2, the design temperature P^, is chosen to have {K) = 
N/2. For increasing parameters P the absolute value of 
the free energy difference is increased. For the minimum 
similarity Q = —N the free energy difference becomes 
independent of /3. Its value at the minimum similarity is 
only determined by the design conditions and is given by 
AF{Q — ~N) = — e {K) as has been shown above. 

The independence of AF{Q = —N) of the testing tem- 
perature /5 is a result of the symmetry of the underlying 
model (fT3| . This symmetry is broken by introducing a 
field like term — J^i J'^i to the energy. It is expected that 
there is some bias towards good or bad contacts leading 
to a such an additional field with 7 7^ 0. For positive 
fields 7 the recognition ability is again expected to be 
increased with respect to the situation where 7 vanishes. 
This is shown by the dotted lines in figure [H 



C. Mean field theory for arbitrary cooperativity 



For the minimum similarity parameter Q = —N the 
free energy difference at fixed Kq is independent of the co- 
operativity J (compare figure [5]). To see this consider the 



After having analysed the infiuence of the cooperative 
terms in the previous paragraphs by means of an asymp- 
totic analysis and Monte Carlo simulations we briefiy 
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FIG. 6: Free energy difference as a function of the similarity 
for fixed J/e = 1/2 in l|13p and different testing temperatures 
(5. The probe ensemble has been designed to have {K) /N — 
0.5. The solid curves correspond to the (3 = 0.5, 0.75, 1.0 and 
1.25 from top to bottom. For the dashed curves /3 = 1.25 and 
an additional field 7 has been applied, namely from top to 
bottom 7/e = 0.05, 0.1 and 0.15. 



sketch how a mean-field treatment can be carried out [45|. 
The discussion will be restricted to the determination of 
the averaged complementarity. As already mentioned the 
variable (Ji9i acts as a random field in lfT3|) and therefore 
techniques from the theory of disordered systems have 
to be applied in the mean field treatment (see, for ex- 



ample, [4fl, |4l|). Thus the auxihary variable ki = 9ia. 



(0) 

i 

which has been introduced in IIV Al and specifies a com- 
plementarity configuration k = (fci,...,fc7v) is used in 
the following. The mean-field approach consists of two 
steps, namely an equivalent neighbour approximation of 
the cooperative interaction term and an asymptotic eval- 
uation of the partition sum for large A^. The equivalent 
neighbour approximation of the Hamiltonian (flSl) reads 



S, 



(35) 



We aim at a calculation of the averaged complemen- 
tarity K — {J2i^i) containing a thermal average with 
respect to the interaction variable 5" and an average 
over the possible complementarity configurations k of the 
probe molecules with respect to the target. The ther- 
mal averaged leads to the distribution P{9\cr'^^'>) of probe 
molecules and thus to a distribution P{k) of the com- 
plementarity configuration itself. Consider first the ther- 
mal average with respect to S. The variable x :— J^i 
appears quadratically in ((35|) . By introducing an ad- 
ditional auxiliary variable y it can be linearised in the 
argument of the Boltzmann factor in the partition sum 
Z{k) = J2s 6xp(— Z^'Wen) with the help of the identity 



+00 



/ « 2 
exp X 




exp 



Na 



-y +axy 



often called Hubbard-Stratonovich transformation in the 
literature. Note that the distribution function P{k) of 
the complementarity configuration is determined by Z{k) 
up to the normalisation. The summation over S can then 
be carried out leading to 

Cx -1-00 
yE^^J / dycxp(^(y,fc)) (37) 



with 



Ay, k) = -^^y^ + ^ In cosh (pjy + ^fc, 



(38) 



In the asymptotic limit of large A'^ the integration over 



the auxiliary field y in (137 
Laplace method (e. g. [4 



Z(k) ^ exp 



can be carried out using the 
47|). This gives 



(39) 




aside from irrelevant factors. The mean field yo is deter- 
mined by the saddle point equation 



1 / /3e 

^^^^ I /5^yo + 



(40) 



Note that the mean field expHcitly depends on the local 
complementarity configuration k. These two equations 
can be used to carry out the configurational average over 
all k to obtain the averaged complementarity (K) by not- 
ing that a particular configuration k contains A''^+^ sites 
with ki = +1 and AT^"' ones with ki = —1. The par- 
tition function Z (and thus the distribution function P) 
as well as the mean field yo are therefore only functions 
of K'-^\ The mean field yo{K^^\ A'*^^^), for example, is 
then given by 



yo 



Ki+) 
N 



■ tanh f3Jy{ 



(36) 



^^+^tanh(^/3Jyo-f 

(41) 

The average over k can thus be converted to an av- 
erage over (A'^+\A'(^)) so that the complementarity 
(AT) = ^A''+' — A"'^') can by worked out using a com- 
puter algebra program. The results are shown in the inset 
of figure [U together with the Monte Carlo findings dis- 
cussed in the previous paragraph. The mean field curves 
behave qualitatively similar as the Monte Carlo curves. 
Using a similar decomposition of the similarity config- 
uration Qi = (T-°'(T-^^ between the target and the rival 
structure into positive contributions and negative 

ones Q^~^ one can calculate the averaged free energy dif- 
ference AF{Q) (compare relation l(30|) ). The resulting 
curves show again the same qualitative behaviour as the 
results from the Monte Carlo simulations. 



11 



V. COOPERATIVITY COUPLING TO RESIDUE 
STRUCTURE 

The importance of cooperativity in biological situa- 
tions was emphasised at the beginning of section IIVI 
In Hamiltonian (fT3| an additional cooperative term has 
been introduced which, however, does not couple to the 
residue distributions on the recognition sites of the two 
molecules in contact. In general the additional coopera- 
tive interaction terms might also couple to the structures 
a and 9 of the target and probe molecule, respectively. 
One possible coupling is given by the Hamiltonian 

n{a, e-s) = -Y^ ( ^^^^ + JY^^^^^A (42) 

i = l \ is / 

The sum in the second term extends over the neighbour- 
ing positions is of the position i on the interface. Again 
the cooperative term will lead to an additional energy 
contribution depending on how the side chains are rear- 
ranged in the interface. In case of a favourable direct 
energy contribution from the hydrophobic interaction at 
site i described by the product (Ji9i the cooperative term 
rewards good contacts like in the Hamiltonian (fT3|) . How- 
ever, in (fT3|) two neighbouring bad contacts due to an 
unfavourable hydrophobic-polar interaction are also at- 
tributed a favourable cooperative contribution. This is 
no longer the case in Hamiltonian l(42|) as the sign of the 
cooperative energy contribution now depends on the sign 
of the hydrophobic interaction energy at position i on the 
interface. It is thus expected that the cooperative terms 
in (|42| lead to a more favourable cooperative contribution 
than those in Hamiltonian (fT3|) . The cooperative terms 
in the Hamiltonians (|42|) and l|42|) are only two possible 
ways to take into account cooperativity, corresponding 
in our modelling to mutual interaction of neighbouring 
variables Si, other extensions are possible as well. 

As already remarked the variable ai9i in ([13)) is ba- 
sically a random field the distribution of which is de- 
termined by the design step. The model (fT3l) is thus a 
random field Ising model where the random field ai9i 
is asymmetrically distributed. In Hamiltonian l(42|) this 
random variable now also couples to the exchange con- 
stant J of the interactions between neighbouring vari- 
ables Si and thus the exchange constant also becomes 
a random variable. The model (|42| is thus an Edward- 
Anderson-like model in a random field with an asymmet- 
rically distributed exchange constant J(Ji9i. 

The two stage approach to obtain the recognition abil- 
ity can now be carried out numerically for the model 
(|42| by calculating again density of states Vlj{K]E) by 
a Monte Carlo simulation. The results for the averaged 
complementarity of the probe molecules and the free en- 
ergy difference are depicted in figure [71 One observes a 
similar qualitative behaviour as the corresponding curves 
for the model (fT3|) . Again, it is found that an increase 
of the parameter J increases the quality of the design 
step in the sense that the probe molecules are better op- 



timised with respect to the target biomolecule indicated 
by an increase of {K) for higher values of J. Similarly, 
the recognition ability measured by the free energy differ- 
ence Ai^ = i^(o) - for a given similarity Q between 
the target and the rival grows for increasing J. 




FIG. 7: Averaged complementarity of the probe ensemble de- 
signed according to the model lf42|l. The dashed curve repre- 
sents J = 0, for the solid curves J/e — 0.2,0.3 and 0.4 from 
bottom up. For comparison, the dotted line depicts the corre- 
sponding curve for the model ifTSll with J/e = 0.2. The inset 
shows free energy differences as a function of the similarity Q 
for the same set of parameters, where /3d is chosen to have 
{K) /N = 0.4 (with (3 = 1.0 for each curve). 



VI. CONCLUSIONS AND OUTLOOK 

We have presented coarse-grained models to study 
properties of molecular recognition processes between 
rigid biomolecules. The development of the models has 
been motivated by experimental investigations on the 
biochemical structure of the interface of protein com- 
plexes. A two-stage approach containing a design of 
probe molecules and a testing of their recognition ability 
has been adopted. This approach has been used to inves- 
tigate the role of cooperativity in molecular recognition. 
The coarse-grained models capture the effects of cooper- 
ativity on a residue specific level. The necessity of such 
an approach has been pointed out in the literature [39| . 
We have shown numerical results and compared them to 
analytic results obtained in the asymptotic limit where 
cooperative interactions dominate over direct hydropho- 
bic interactions between the residues and in the mean- 
field formulation of the models. It turned out that a 
small contribution due to cooperativity can already sub- 
stantially infiuence the recognition ability, corroborating 
the suggestion that cooperativity has a considerable ef- 
fect on the recognition specificity. Two possibilities to in- 
clude cooperative interactions have been explicitly anal- 
ysed leading to similar qualitative results. We note in 
passing that the proposed coarse-grained model can re- 
produce qualitatively the experimental observation that 
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in antigen-antibody complexes, which require a relatively 
high binding flexibility, a small number of strong non- 
covalent bonds across the interface seems to be favoured 
compared to a situation with many but rather weak 
bonds. The details are published elsewhere (28| . 

The proposed approach to study molecular recognition 
with coarse-grained lattice models can be extended in 
various ways. Apart from working with refined models, 
which capture more details of the actual physical interac- 
tions across the interface of the two biomolecules, the de- 
sign step can be modified to mimic natural evolution in a 
more realistic manner. The presented analysis considered 
on the level of the target and the rival molecule is basi- 
cally a single molecule approach, although the molecules 
are described in a very coarse-grained way. The infiu- 
ence of the heterogeneity of the mixture of target and 



rival molecules encountered in real physiological situa- 
tions as found in a cell, for example, can be incorporated 
in our analysis. To this end ensembles of targets and 
rivals differing in certain properties as for example corre- 
lations and length scales have to be considered. A recent 
study indeed indicates that the local small-scale struc- 
ture related to the distribution of the hydrophobicity on 
the recognition site of the biomolecules seems to play a 
crucial role in molecular recognition (2^ . 
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